Clustering Based Attribute subset Selection using Fast Algorithm
نویسندگان
چکیده
منابع مشابه
A Fast Clustering-based Feature Subset Selection Algorithm
The paper aims at proposing the fast clustering algorithm for eliminating irrelevant and redundant data. Feature selection is applied to reduce the number of features in many applications where data has hundreds or thousands of features. Existing feature selection methods mainly focus on finding relevant features. In this paper, we show that feature relevance alone is insufficient for efficient...
متن کاملA Review Of Fast Clustering-Based Feature Subset Selection Algorithm
In this paper we cover some reference paper and compared different algorithm on the basis of their performance and selection of data set. Where the efficiency concerns on the time evaluation of features selection, and the effectiveness is related to the quality of the subset of features selection. We analysis this report based on feature subset selection algorithm from the years of 1997 to 2013...
متن کاملCorrelation-based Attribute Selection using Genetic Algorithm
A Data Warehouse (DW) is a repository of information collected from multiple sources, stored under a unified schema, and that usually resides at a single site. DWs are constructed via a process of data cleaning, data integration, data transformation, data loading, and periodic data refreshing. Integration of data sources refers to the task of developing a common schema as well as data transform...
متن کاملA Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data
Clustering which tries to group a set of points into clusters such that points in the same cluster are more similar to each other than points in different clusters, under a particular similarity metric. In the generative clustering model, a parametric form of data generation is assumed, and the goal in the maximum likelihood formulation is to find the parameters that maximize the probability (l...
متن کاملFast and Improved Feature subset selection Algorithm Based Clustering for High Dimensional Data
The Clustering is a method of grouping the information into modules or clusters. Their dimensionality increases usually with a tiny number of dimensions that are significant to definite clusters, but data in the unrelated dimensions may produce much noise and wrap the actual clusters to be exposed. Attribute subset selection method is frequently used for data reduction through removing unrelate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal on Cybernetics & Informatics
سال: 2015
ISSN: 2320-8430,2277-548X
DOI: 10.5121/ijci.2015.4220